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Summary 


Using  methods  which  have  been  employed  for  obtaining  bounds 
on  continuous  failure  distributions,  bounds  on  lattice  distributions 
are  derived  under  the  increasing  (decreasing)  failure  rate  assumptions. 
The  discrete  bounds  are  convenient  in  a  number  of  applications  such 
as  life  table  analysis  and  reliability  theory. 


1.  Introduction 


A  discrete  distribution  of  a  random  variable  X  is  called 
a  lattice  distribution  if  there  exist  real  numbers  a  and  h  >  0 
such  that  every  possible  value  of  X  can  be  represented  in  the 
form  a  +  kh  ,  where  k  runs  through  integer  values.  We  call  h 
the  span  of  the  distribution.  The  geometric,  binomial  and  Poisson 
distributions  are  important  examples  of  lattice  distributions. 

It  seems  that  there  is  essentially  no  literature  on  Chebyshev 
type  bounds  for  lattice  distributions.  Of  course,  the  classical 
Chebyshev  bounds  are  attained  by  discrete  distributions.  However, 
these  need  not  be  lattice.  The  general  method  used  by  Karlin  and 
Studden  and  others  [12,  11]  for  obtaining  Chebyshev  type  bounds 
can,  of  course,  in  theory  be  applied  to  lattice  distributions.  However, 
we  will  restrict  ourselves  to  a  special  class  of  lattice  distributions 
where  these  methods  do  not  provide  sharp  bounds. 

For  convenience  assume  that  a  ■  0  ,  h  ■  1  and  let 
p  ■  P[X  »  k]  ;  (k  ■  0,1,2,....  )  .  Define  the  failure  rate  r(k) 

A 

for  lattice  distributions  as 


r(k) 


» 

2  p 

J* 


J 


if 


v° 


and  note  that  0  £  r(k)  £  1  .  Let 
00 

1  -  F(k)  -  2  p  .  We  say  that  F 
J-k  J 


k-i 

F(k)  ■  2  P  and  F(k)  - 

J-0  J 

is  lattice  IFR  (lattice  DFR) 
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if  and  only  if  r(k)  is  non-decreasing  (non-increasing)  in  k  . 

Note  that  it  is  shown  in  [3]  that  F  is  lattice  IFR  (lattice  DFR) 
iff  log  F(k)  is  concave  (convex).  Further  equivalent  definitions 
of  lattice  IFR  (lattice  DFR)  distributions  in  terms  of  Polya 
frequency  function  of  order  two  and  total  positivity  of  order  two  have 
been  discussed  in  [1,2,3,10].  Note  that  the  negative  binomial 
distribution 


(1-P) 


k-a 


a  >  o 

k  n  0,1,2, . . . . 


is  lattice  IFR  (increasing  failure  rate)  for  Q  >  1  and  lattice  DFR 
(decreasing  failure  rate)  for  a<  1  .  For  a  ■  1  ,  it  is  the 
geometric  distribution  with  a  constant  failure  rate  and  hence  it 
belongs  to  both  IFR  and  DFR  classes.  The  geometric  distribution 
is  the  only  distribution  with  this  property  in  the  discrete  case. 

The  binomial  and  Poisson  distributions  are  lattice  IFR. 

Lattice  distributions  occur  naturally  in  many  applications. 

Fatigue  failure  data  obtained  under  dynamic  loading  is  commonly 
recorded  in  terms  of  the  number  of  cycles  to  failure.  For  many 
structures  subject  to  fatigue  failure,  a  lattice  IFR  distribution 
would  seem  appropriate.  A  mathematical  model  describing  such  types 
of  failure  has  been  discussed  in  [5],  As  another  example  we  mention 
human  mortality  data  where  it  is  common  to  record  time  of  death 
to  the  nearest  yesu*  or  the  nearest  month.  For  certain  types  of 
cancer  mortality  data  a  decreasing  failure  rate  seems  appropriate,  [4] . 

In  any  situation  where  grouped  data  is  derived  from  a  continuous 
IFR(DFR)  distribution,  a  lattice  IFR  (lattice  DFR)  distribution  will  be 


appropriate. 
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2.  Bounds  on  Lattice  Distributions 


2.1  IFR  Bound: 

Let  F  denote  a  left  continuous  distribution  function  such 
that  F(o”)«  0  .  Bounds  on  1-F  under  the  IFR  (DFR)  assumption  in 
the  continuous  case  when  the  first  moment  is  given  can  be 

found  in  [2].  In  the  IFR  case,  the  lower  bound  is  given  by  Theorem  1. 


Theorem  1:  Let  F  be  a  left  continuous  distribution.  Let  F 

be  IFR  with  mean  ,  then 


F(t)  ;> 


;  t  <\i± 
i  1 • 


The  inequality  is  sharp. 

A  similar  bound  exists  for  lattice  distributions  and  we  shall 
discuss  this  after  proving  the  following  lemna. 

Lemma:  Let  F  be  a  lattice  distribution  with  mean  [n  .  Assume 

k  1 

that  F  is  IFR  and  let  s,  ■  I  F(i)  then  for  t  £  ll. 

K  i-1  1 


(1)  F(t)  ^  F(t-k)  (  1  -  F-^I kl )  ;  h  -  1,2,..., t-1  . 

^1  t-k-1 

Proof:  We  shall  prove  (l)  by  induction.  By  definition, 

<x  00 

F(k)  *  i  p  ,  so  that  log  F(k)  ■  log  (  2  p  )  .  Note  that  the 

i*k  1  i-k  1 

function  log  F(k)  is  defined  only  for  integer  values.  By  linear 
interpolation  we  can  define  log  F(k)  for  any  real  value  of  k  as 
shown  in  Figure  2. 1. 
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Since  we  assume  that  F  is  IFR,  it  follows  that  log  F(k) 
is  concave.  Hence  there  exists  a  supporting  line  to  log  F(k) 
shown  in  Figure  2.1. 


as 


Log  F( k) 


Thus  by  log  concavity,  we  have 


log  F(k)  £  (Li+1-  L^k  -  iLi+1+  (i+1)^  ;  k  -  i,i+l#... 
But  *  log  F(i)  and  =  log  F(i+l)  f  therefore 


F(k)  ^ 


a 

L  F(i )  J 


i+1 


mi 

[F(i+1)32 


for  k  =  i,i+l,.. . 


Let  =»  F(i )  +  F(2)  +. ..+  F(k)  where  sQ  =  0  .  Then  we  have 


00 


00 


i-1 

^  ■  I ?(k)  ”  ZF(k)  +  Zf(k) 

k«l  k»l  k»i 


00 


♦  y 

1-1  kti  L  F(i)  J  (Fdri)]1 


am 


i+l 


00 


d-i + y 

^LF(i)  ■■ 


-k-i 


Thus 

^1  <■  8i-l 

obtain 

(2) 

F(i+1) 

_  2 
+  _K±1 


F(i)  -  F(i+1) 


Solving  for  F(i+l)  ,  we 


2 

F(l) 

LWl 


From  (2)  with  i-t-1  ,  it  follows  that  (l)  holds  for  k  -  1  .  Now 
assume  that  (l)  is  true  for  k*m-l  ;  (n  £  t)  i.e. ; 


(3) 


F(t)  ^  Ff^-n+l)  (l  - 

^1  t-n 


vn-1 


0(F(t-n+l) 


s  .  x  vn-l 

where  0(x)  »  x(l  -  — — —  ) 

^l""t-n 
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Note 


(i  ■  *  Ts  ( 

i .  “  ") 

'  wv 

'  ^l"St-n  ' 

if  • 

•V8t-n 

To  Show:  x  <  —  — 

1  . .  n 

“  P~  “8 

By  definition  x  »  F(t-n+l)  £  1  ,  hence  we  need  only  show  — - — —  1  . 

Mn-St-n  h"t+n 

But  s  ^  t-n  implies  -  ^  1  since  t  £ 

u*n  n  n  j. 

by  assumption. 

Thus  we  can  substitute  for  F(t-n+l)  in  (})  the  bound  given 
by  (2)  with  i=«t-n  and  obtain 


F(t)  F(t-n)  (  1  -  ^ V.nL .  ) 

^1  t-n-1  y 


F(t-n)  (l  -  )' 

1  .  V  ^l~at-D-r 

LU  “8.  _ 


n-1 


^“Vn 


Noting  s.  ■  s  +  F(t-n)  and  letting  c  =  1  -  , 

t-n  t-n-1  h‘Bt-n-l 

the  above  inequality  becomes 


F(t)  ^  F(t-n)  c 


V  Li  -8. 


-I  n-1 


^rSt-n 
^’8t-n-l  J 


2  n-1 


F(t-n)  c  [  £■  1  -  cnF(t-n)  . 


This  is  (l)  for  k  =  n  ,  and  the  induction  is  complete. 


Theorem  2:  If  F(t)  is  a  lattice  IFft  distribution  with  mean 

00 

1  placing  mass  ^  at  k  «  0,1,2,...  and  F(t)  •  X  p  ;  then 

k«t 
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F(t)  * 


C"1  ) 

for 

t  <  pL_ 

Vl^  / 

m‘ki! 

for 

t  ] 

1+ul'[Ml] 

0 

for 

t  >  (uj 

Where  [p.^]  denotes  the  greatest  integer  contained  in  .  The 
inequality  is  sharp. 

Proof:  Since  log  F(k)  is  concave  in  k  and  noting  figuie  2.1 

for  i-0  ,  we  see  that 


log  F(k)  L* 

1  k 

or  F(k)  <.  [F(l)] 

°°  oo  k 

Thus  ^  F(k)  <.  [?(!)]  - 


(k)  or  F(l ) 


k-1  k=l 


1+i^ 


l-F(l) 


Now  consider  (l)  with  k*t-l  ;  this  gives 


F(t)  ^F(l) 


1  - 


rr- 


iKf(i))  . 


Note  that  ♦'(x)  •  (l-  ^)t"2(l-  i  0  if  ti^  and  ^  1 

(since  x  £  l)  ,  and  thus  from  (4),  we  have 


F(t)  ^  ^ 


for 
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This  bound  for  t  £  p.  Is  attained  by  the  geometric  distribution;  i.e. , 


1 

1-H^ 

\  "  1-hj^ 

;  k 

«012 

V  ^  •  I  •  • 

We  conjecture  that 

~  i ,-bi.  ] 

F(t)  £  < 

i  i 

for 

t  =  [p^]  +  1 

for 

t  >  [nj  +  1 

where  [ij^J  denotes  the  greatest  integer  contained  in  . 

For  t  >  Qx_  ]  +  1  ,  it  is  easily  seen  that  the  bound  is  attained  by 

t  -  [^1  +  1 
t  >  +  1 


The  upper  bound  on  1-F  for  the  continuous  distribution  when  F 
is  IFR  is  given  by  the  following  theorem. 

Theorem  3;  Let  F  denote  a  left  continuous  distribution.  Assume 
that  F  is  IFR  with  mean  ^  ,  then 


*  <L 

t  >  ^ 


where  w  depends  on  t  and  satisfies  1-w^  ■  e 
is  sharp  [2]  . 


The  inequality 


t 
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An  analogous  bound  exists  for  lattice  distrioutions  and  is  given  by 


Theorem  U :  Let  F  denote  a  lattice  distribution. 

F  is  IFR  with  mean  4^  ,  then 


F(t)  <1 


{ 


1 

t 

q. 


t  >  ^ 


where  q  depends  on  t 
is  sharp. 


and  satisfies 


^  • 


Proof: 


Let  us  define, 


k  «»  0,1,2, . . .  ,t 
k  >  t  . 


Assume  that 


The  inequality 
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Note  that  log  G(k)  is  linear  and  log  F(k)  is  concave, 
hence  F(k)  can  cross  G(k)  at  most  once  for  k  <  t  . 

If  t  >  p^  ,  we  can  always  determine  q  so  that 


00 

I  500  -  I*! 


or 


k^l 

t 

I 

k«*l 


or 


.  t 

1  -q  . 

q(^—  )  ■  p^  which  is  solvable  for  q  . 


Since,  F(k)  and  G(k)  are  distribution  functions  with  the  same 

mean  p.^  ,  they  must  cross  at  least  once  for  0  <  k  £  t  . 

_ 

Thus,  for  this  choice  of  q  ,  F(k)  necessarily  crosses  q 

— .  .  t 

exactly  once  from  above  for  0  <  k  <  t  .  Therefore,  F(t)  <  q 
unless  F  coincides  identically  with  G  .  The  degenerate  distribution 
concentrating  at  p.^  provides  the  upper  bound  for  t  £  p^  when 
p^  is  an  integer. 

Since  G  is  also  IFR,  the  bound  is  sharp. 


2. 2  DFR  Bound : 

It  is  shown  in  [2]  that  the  Bharp  lower  bound  on  1-F  where  F 
is  a  left  continuous  DFR  distribution  with  known  mean,  say  p^  ,  is  zero 
and  the  sharp  upper  bound  is  given  by  the  following  theorem. 

Theorem  5:  Let  F  be  left  continuous.  Assume  that  F 

is  DFR  with  mean  p^  ,  then 
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< 


t 

e^l 


The  inequality  is  sharp. 

Similarly,  for  lattice  distributions  the  sharp  lower  bound  on 
1-F  where  F  is  a  lattice  DFR  distribution  with  known  mean  p1 
is  zero.  For  exsmple,  consider 

[  0  ,  k<  0 

G(k)  -  < 

I  aqk  ,  k-0,1,2,.... 


where  a  is  arbitrarily  small  and  0  £  q  £  1  .  Choose  q  such  that 
G  has  mean  p  .  Then  G  is  DFR  with  a  Jump  at  the  origin  and 
G(k)  £  a  for  all  k  ■  0,1,2,...  .  Letting  a  -*•  0  ,  we  see  that  the 
sharp  lower  bound  is  zero. 

Thus,  for  lattice  DFR  distributions  only  the  upper  bound  on 
F(t)  is  given. 


Theorem  6:  Let  F  be  a  lattice  distribution.  Assume  that  F  is 

DFR  with  mean  p^  ,  then 


F(t)<. 


A). 

^t-r ' 


*  < 

t>.^1  • 


The  inequality  is  sharp. 
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Proof:  Ey  definition,  F(k)  ■  2  p  . 

We  remark  that  the  function  log  F(k)  is  defined  only  for  integer 
values.  By  linear  interpolation,  we  can  define  log  F(k)  for  any 
real  value  of  k  as  shown  in  figure  2.  3. 

Since  we  assume  that  F  is  DFR,  it  follows  that  log  F(k)  is 
convex,  and  therefore,  there  exists  a  supporting  line  to  log  F(k), 
say, at  k  ■  t  . 


-Ik- 


? 


Let  a  denote  the  intersection  of  the  supporting  line  with  y-oxis 


and  let  L  a  log  F(t)  .  Obviously,  L  <  G  £  0  and  the  equation  of 


L-a,k 


supporting  line  is  y  =  (-r~)  +  G  .  Thus  we  have, 


log  F(k)  ^  (~^)k  +  a  ;  k  >  0 


F(k)  ^  e 


(^)k+a 


Hence,  ^ 


ak+a 


where  a 


a(k-l) 


since 


e  <  1  • 


That  is 


^1  ^  ~ 
e  -1 


Solving  for  L  ,  we  obtain 


L  a  -  t  log  (l  +  —  )  . 

n. 

a 

£ 

Let  0(a)  ■  a  -  t  log(l  +  —  )  so  that 


L  1  0(a) 

L  £  max  0(a)  . 

L<a^O 


for  some  a  ,  L  <  a  ^  0  , 
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Note  that 


0'  (or)  -  1  - 


~o 

e  Ul+  1 


and 


0"(a) 


[e”°M'1+  I]' 


<  o 


Since  0"(a)  <  0  ,  we  know  that  0(a)  is  strictly  concave  and  hence 
it  has  a  maxima,  say,  at  a  o  a*  . 

Thus,  from  (6)  we  have, 


(7)  L£0(a*) 


so  it  remains  to  obtain  a*  .  a*  can  be  found  by  equating  0'(a) 
to  zero;  that  is 


1 


t 

-a  . 
e  nx+  1 


-  0  . 


This  gives  us. 


a*  ■  log( 


for  t  ^  . 


Note  that  for  t  <  ^  , 

Also  we  have 

0"(a)  <  0  for  any  a  where  L  <  a  £  0  . 
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! 


n 

\ 


Thus,  for  t  <  }jl^  ,  we  have  0'(a)  >  0  and  0"(ot)  <  0  which  shows 
that  0  is  strictly  increasing  concave  function  and  hence  the 
maximum  of  0(a)  is  achieved  at  the  origin. 

That  is 


if  t  < 

if  . 


Using  (8),  the  expression  in  (7)  becomes, 


L<-  < 


0(0) 


0  [log  (:£“)] 


if  t<P1 
if  t  ^ 


or 


<, 


log(li  )* 

M1+1 


if  t  <  ^ 


logUt^r)(T")t3  if 


But  L  »  log  F(t)  ,  so  finally  we  have 


(9) 


F(t)  £  < 


vt-l/v  t 


if  t  <  ^ 
if  t  ^ 


We  observe  that  equality  in  (9)  for  t  <  is  attained  by  the 

1  /  ^1  K 

geometric  distribution  pi  =  - —  (- - )  ;  k  =  0,1,2,...  and  for 

k  i+p^ 
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t  ^  |i^  equality  In  (9)  is  attained  by  the  geometric  distribution 

“  |iq  k 

G  where  G(k)  ■  ! )  ;  k  ■  1,2,...  .  Hence,  the  inequalities 

given  by  (9)  are  sharp. 

2.  5  Comparison  of  the  Continuous  and  Lattice  Bounds : 

Comparing  Theorem  1  and  Theorem  2,  we  remark  that  for 
fixed,  the  continuous  upper  bound  is  everywhere  smaller  than  the 
lattice  upper  bound.  Similar  comparisions  hold  for  Theorem  5  and 
Theorem  6. 
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3.  Application  of  Bounds 


3.1  Motivation  to  Life  Table  Analysis: 

We  note  that  the  failure  rate  function,  r(t)  is  familiar 
in  other  branches  of  science  and  is  known  by  a  variety  of  names. 

In  the  life  sciences  it  is  used  by  actuaries  under  the  name  of 
"Force  of  Mortality"  to  compute  mortality  tables  [15 ].  The  important 
work  of  life  table  analysis  is  to  calculate  survival  rate,  i.e. , 
the  proportion  of  persons  surviving  a  specified  interval  of  time. 
Various  methods  for  calculating  survival  rate  have  been  discussed 
in  several  research  papers  by  Ederer  (7)>  Ederer,  Cutler  and  Axtell 
[8],  Berkson  and  Gage  [4],  etc.  Some  functions  that  have  been 
used  to  define  the  force  of  mortality  are  Gompertz's  formula  and 
Makeham's  formula.  A  detailed  discussion  concerning  the  estimation 
of  the  parameters  in  Makeham's  law  of  mortality  can  be  found  in 
Grenander ' s  paper  [9 ] . 

Grenander  remarks  in  his  paper  that  we  should  specify  carefully 
the  probability  assumptions  to  be  used  in  mortality  studies.  The 
basic  assumption  is  that  our  data  forms  a  sample  of  independent 
observations  cf  stochastic  variables,  with  the  same  probability 
distribution  except  when  sampling  errors  are  serious.  Further 
Grenander  remarks  in  his  paper  that  mortality  estimates  will  be 
used  to  compute  premiums,  reserves  and  so  on,  and  it  is  the  future 
mortality  that  is  of  interest  for  making  a  prognosis  for  several 
decades  forward  in  time.  Since  the  development  of  the  mortality 
depends  upon  factors  that  are  very  difficult  to  handle,  it  Is  likely 
that  only  a  rough  prognosis  can  be  made. 


We  are  interested  in  giving  upper  bounds  on  survival  probability 
which  will  be  useful  in  the  life  sciences  in  order  to  compare 
mortality  rates  in  different  populations.  To  give  such  bounds, 
which  can  be  used  for  all  ages,  we  need  to  have  some  a  priori 
information  about  the  failure  rate  function  such  as  IFR  or  DFR,  as 
we  shall  see  in  a  cancer  study. 

3.2  Application  of  bounds  to  a  cancer  study: 

Patient  survival  is  generally  accepted  as  the  principal  criterion 
for  measuring  the  effectiveness  of  treatment  in  cancer.  The  American 
College  of  Surgeons  requires  the  maintenance  of  a  cancer  registration 
and  follow  up  program  for  approval  of  a  hospital  cancer  program.  We 
shall  investigate  a  particular  type  of  cancer  from  a  large  number  of 
cancer  studies  given  in  [6]  in  order  to  apply  the  bounds. 

We  are  interested  in  giving  upper  bounds  for  the  survival 
probability  (also  called  survival  rate).  Data  of  table  1  is  taken 
from  [6,  p.  31f  ]>  which  represents  the  observed  survival  rates 
for  breast  and  genito -urinary  organs  cancer  for  the  period  1942-56 
of  both  sexes.  For  this  particular  type  of  cancer  there  were  41,157 
patients  at  the  beginning  of  the  year  1942.  The  observed  survival 
rates  were  calculated  by  the  usual  formula  and  is  shewn  in  Table  1. 

Our  aim  is  to  use  this  data  to  give  an  upper  bound  for  the 
survival  rate  and  to  extend  it  for  all  ages. 
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TABLE  -  1 


Observed  Survival  Rates, 

Breast  and  Genito-Urinary  Organs  Cancer, 
Detailed  Sites  By  All  Ages,  1942  -  1956,  Both  Sexes. 


Years  after 
diagnosis. 

Observed  Survival 
rates  in  percentage. 

A  A 

Difference 

t 

F(t) 

Log/(t) 

k*SeF(t+l)-LogeF(t) 

0 

100.0 

4.60517 

-  0.33407 

1 

71.6 

4. 27110 

-  0.18849 

2 

59.3 

4.08261 

-  0.13715 

3 

51-7 

3.94546 

-  0.11682 

4 

46.0 

3. 82864 

-  0.09814 

5 

41.7 

3.73050 

-  0.09291 

6 

38.0 

3.63759 

-  0.08224 

7 

35.0 

3.  55535 

-  0.07719 

8 

32.4 

3. 47816 

-  0.07696 

9 

30.0 

3.  40120 

-  0.07616 

10 

27.8 

3.  32504 

-  0.07080 

11 

25.9 

3.25424 

-  0.06789 

12 

24.2 

3.18635 

-  0.07283 

13 

22.5 

3.11352 

-  0.06425 

14 

21.1 

3. 04927 

-  0.05354 

15 

20.0 

2.99573 

Since  successive  differences  of  the  function  LogeF(t)  are 

increasing  in  t  except  at  t  =  12  ,  we  can  assume  that  the  successive 

differences  of  the  function  LogeF(t)  are  increasing  in  t  for  all 

a 

values  of  t  .  The  graph  of  the  function  LogeF(t)  is  shown  in 
Figure  5.1. 
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YEARS  AFTER  DIAGNOSIS  ,  t 


FIGURE  3.1 


From  successive  differences  of  the  function  logeF(t)  and  from 

the  graph  it  is  clear  that  logeF(t)  is  convex  and  therefore  we  claim 

that  F  is  DFR. 

Now  we  shall  obtain  upper  bounds  for  1-F  using  Theorem  5  and 
Theorem  6  and  compare  them. 

We  need  to  estimate  (i,  ,  the  mean  of  F  .  Let  denote  an 
estimate  of  ^  .  Thus, 

00 

4  ■  Z f(t) 

t=»i 

14  00 

-  z f(t)  + 

t-1  t>  14 

»  ,  say. 


From  the  data  of  Table  1,  we  have 


V4  - 

^ "  2, F(t) "  527-2  • 
t»i 


Hence,  it  remains  to  calculate  .  From  the  graph  of  the  function 

CX.  _ 

loggF(t)  it  is  clear  that  for  t  ^  8  ,  F(t)  has  an  exponential  form 
and  therefore,  ^  can  be  approximated  by 


00 


*  L  F(t) 

t>  14 


00 


14 


g(x)dx  where  g(x)  is  an  exponential  curve. 


Let  g(x)  ■  ce 


■a(x-l4)  for  x  ^  14  . 


So.  log  g(x)  -  log  c  -  a(x-l4)  implies  log  c  is  equal  to  the  height  at 

*  P  P 
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x  ™  14  and  -a  1b  the  slope  of  the  curve  log  g(x)  .  This  gives 

e 

c  ■  21,1  and  an  estimate  of  -a  is  given  by 


-a  - 


LogeF(9)  -  logeF(l2) 
9-12 


0.07162  . 


Hence, 


^1  "  I  k  (21*1)e0,07l62(X"l4)^ 

-  (21.1)/We°*°7l62dy 


294.6  . 


Finally,  we  obtain 


^  -  527.2  +  294.6 

-  821.8 


or  yesLrs  . 

Using  an  estimate  of  the  mean  of  F  as  8. 2l8,  we  shall 
calculate  upper  bounds  for  1-F  in  Table  2. 
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TABLE  -  2 


Upper  Bounds  on  Survival  Rate. 


Years  after 
diagnosis. 

t 

Observed  survival 
rates  in  . 

F(t) 

Upper  bounds  on  F(t) 
using  Th.  6  in  $  . 

Upper  bounds  on  F(t) 
using  Th.  5  in  $  . 

0 

100.0 

100.00 

100.00 

1 

71.6 

89.15 

88.54 

2 

59.3 

79.47 

78.37 

3 

>1.7 

70.85 

69.42 

4 

46.0 

63.16 

61.47 

5 

41.7 

56.31 

54.42 

6 

3Q.0 

50.20 

48.19 

7 

35.0 

44.75 

42. 66 

8 

32.4 

39.89 

37.78 

8.218 

31.9 

38.94 

36.79 

(By  Interpolation) 

9 

30.0 

35.55 

33.58 

10 

27.8 

31.80 

30.23 

11 

25.9 

28.79 

27.48 

12 

24.2 

26.29 

25.18 

13 

22.5 

24.16 

23.24 

14 

21.1 

22.39 

21.58 

15 

20.0 

20.82 

20.13 

From  the  last  two  columns  we  observe  that  the  upper  bounds  on 
F(t)  obtained  by  Theorem  6  are  everywhere  larger  than  those  obtained 
by  Theorem  5.  These  bounds  can  be  compared  graphically  as  shown  in 
Figure  3.2  and  Figure  3.3. 
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SURVIVAL  RATE  IN  PERCENTAGE,  Y 


-27- 


-28- 


These  results  on  bounds  are  of  particular  interest  in  comparing 
the  mortality  in  different  populations,  when  a  priori  information  about 
the  mortality  function  is  known. 

Suppose  we  are  interested  in  comparing  the  survival  rates  of 
two  types  of  cancer,  say,  stomach  and  lung.  Let  us  assume  that  both 
have  decreasing  failure  rate  with  mesm  jjl^  and  respectively.  We  note 
that  one  of  the  means  is  greater  than  the  other  say,  p^  >  p^  .  Also 
note  the  the  estimate  of  the  means  can  be  found  by  studying  smaple 
information.  Using  Theorem  6,  we  can  find  upper  bounds  on  survival 
rates  for  both  stomach  and  lung  cancer.  Then,  by  comparing  the  upper 
bounds  cn  stomach  and  lung  cancer,  we  can  estimate  the  greatest  difference 
in  survival  rate  for  different  periods. 

3.3  Application  of  bounds  to  reliability  theory: 

In  reliabil  tty  theory,  a  decreasing  failure  rate  would  seem 
to  correspond  to  some  physical  mechanism  of  improvement  with  age, 
so  that  the  longer  the  unit  survives  the  less  the  chance  of  failure 
in  the  next  unit  of  time.  We  will  discuss  the  reliability  problem 
given  in  [13]  by  establishing  the  upper  bound. 

Proschan  was  interested  in  obtaining  information  as  to  the 
distribution  of  failure  intervals  for  the  air  conditioning  system 
of  each  member  of  a  fleet  of  Boeing  720  Jet  airplanes  [13].  Records 
were  kept  for  the  time  of  successive  failures  (to  the  nearest  hour) 
of  the  air  conditioning  system  [13,  p.  3163. 

First  we  note  that  it  was  proved  in  [13]  that  the  failure  distribution 

formed  by  pooling  failure  intervals  for  a  given  component  operating 

in  different  types  of  equipment  would  be  DFR.  Thus,  we  know  that 

the  data  follows  a  DFR  distribution  and  hence  we  will  give  an 
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upper  bound  on  survival  probability  using  Theorem  6.  We  remark  that 
an  estimate  of  the  parent  mean  can  be  estimated  by  sample  information 
and  it  was  found  in  [13]  that  the  mean  of  all  the  failure  intervals 
was  93.1^.  Since  the  sample  size  is  large,  this  estimate  should  be 
fairly  accurate.  Using  93.1^  as  an  estimate  of  the  pareat  mean, 
we  give  theoretical  upper  bound  for  the  pooled  data. 

Figure  shows  the  upper  bounds  on  survival  probability. 

The  cross-over  in  the  beginning  for  the  observed  survival  probability 
may  be  due  to  random  fluctuation. 
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TIME  UNTIL  FAILURE,  X.  Hours 
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